Picture for Jinyang Wu

Jinyang Wu

Learning to Adapt SFT Data for Better Reasoning Generalization

Add code
May 26, 2026
Viaarxiv icon

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Add code
May 21, 2026
Viaarxiv icon

Implicit Hierarchical GRPO: Decoupling Tool Invocation from Execution for Tool-Integrated Mathematical Reasoning

Add code
May 18, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

RobotEQ: Transitioning from Passive Intelligence to Active Intelligence in Embodied AI

Add code
May 07, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon

GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents

Add code
Mar 16, 2026
Viaarxiv icon

Quantizer-Aware Hierarchical Neural Codec Modeling for Speech Deepfake Detection

Add code
Mar 10, 2026
Viaarxiv icon

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Add code
Feb 05, 2026
Viaarxiv icon

Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Add code
Feb 01, 2026
Viaarxiv icon